Reinforcement Learning-Based Load Shared Sequential Routing

نویسندگان

  • Fariba Heidari
  • Shie Mannor
  • Lorne Mason
چکیده

We consider event dependent routing algorithms for on-line explicit source routing in MPLS networks. The proposed methods are based on load shared sequential routing in which load sharing factors are updated using learning algorithms. The learning algorithms we employ are either based on learning automata or on online learning algorithms that were originally devised for solving the adversarial multi-armed bandit problem. While simple to implement, the performance of the proposed learning algorithms in terms of blocking probability compares favorably with the performance of other event dependent routing methods proposed for MPLS routing such as the Success-to-the-top algorithm. We demonstrate the convergence of one of the learning algorithms to the user equilibrium within a set of discrete event simulations.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Load Shared Sequential Routing in MPLS Networks: System and User Optimal Solutions

Recently Gerald Ash has shown through case studies that event dependent routing is attractive in large scale multi-service MPLS networks. In this paper, we consider the application of Load Shared Sequential Routing (LSSR) in MPLS networks where the load sharing factors are updated using reinforcement learning techniques. We present algorithms based on learning automata techniques for optimizing...

متن کامل

Predictive Q-Routing: A Memory-based Reinforcement Learning Approach to Adaptive Traffic Control

In this paper, we propose a memory-based Q-Iearning algorithm called predictive Q-routing (PQ-routing) for adaptive traffic control. We attempt to address two problems encountered in Q-routing (Boyan & Littman, 1994), namely, the inability to fine-tune routing policies under low network load and the inability to learn new optimal policies under decreasing load conditions. Unlike other memory-ba...

متن کامل

Performance and Analysis of Spot Truck-load Procurement Markets Using Sequential Auctions

Title of Dissertation / Thesis: PERFORMANCE AND ANALYSIS OF SPOT TRUCK-LOAD PROCUREMENT MARKETS USING SEQUENTIAL AUCTIONS Miguel Andres Figliozzi, Ph.D., 2004 Dissertation / Thesis Directed By: Professor Hani Mahmassani, Civil and Environmental Engineering Department Competition in a transportation marketplace is studied under different supply/demand conditions, auction formats, and carriers’ b...

متن کامل

Multicast Routing in Wireless Sensor Networks: A Distributed Reinforcement Learning Approach

Wireless Sensor Networks (WSNs) are consist of independent distributed sensors with storing, processing, sensing and communication capabilities to monitor physical or environmental conditions. There are number of challenges in WSNs because of limitation of battery power, communications, computation and storage space. In the recent years, computational intelligence approaches such as evolutionar...

متن کامل

Predictive Q-routing: a Memory-based Reinforcement Learning Approach to Adaptive Traac Control

In this paper, we propose a memory-based Q-learning algorithm called predictive Q-routing (PQ-routing) for adaptive traac control. We attempt to address two problems encountered in Q-routing (Boyan & Littman, 1994), namely, the inability to ne-tune routing policies under low network load and the inability to learn new optimal policies under decreasing load conditions. Unlike other memory-based ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007